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(54) Power saving an a data processing circuit 

(57) A data processing circuit (201 ) is shown having 
elements (315, 310) which operate in response to de- 
coded instruction while receiving clocking signals. In- 



struction types are identified and clocking signals to at 
least one of the elements is enabled or disabled in de- 
pendence upon whether the element is required for the 
execution of the identified instruction type. 
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Description 

The present invention relates to a data processing 
circuit having a plurality of elements, wherein said ele- 
ments operate in response to coded instructions while s 
receiving clocking signals. 

Data processing environments are known in which 
elements operate in response to decoded instructions 
while receiving clocking signals. A clocking signal is of- 
ten generated by a quartz oscillator or similar and the 10 
signals are distributed throughout the circuit so as to 
provide a common reference for manipulations per- 
formed within the elements. The overall processing 
speed of a device may be increased by increasing the 
frequency of the clock although an upper bound will be is 
reached beyond which satisfactory operation is not pos- 
sible. Furthermore, the clocking of elements within a cir- 
cuit also results in power dissipation which is an impor- 
tant factor when considering apparatus driven by a lim- 
ited power supply, such as that provided by a battery. 20 

A known approach towards reducing power con- 
sumption is to effectively stop the operation of the clock 
when processing within the circuit is not required. The 
chip is said to be placed in its idle mode and additional 
components are required in order to selectively place 2s 
the circuit in one of its operational modes; that is in its 
active power dissipating mode or its reduced power dis- 
sipation idle mode. A problem with this approach is that 
the stopping of the clock results in a total shut down of 
the circuit therefore periods during which the clock may 30 
actually be stopped are limited and the power saving 
may be off-set by the additional measures that must be 
taken when selecting periods of shut down. 

According to a first aspect of the invention there is 
provided a data processing circuit comprising a plurality 3S 
of elements, wherein said elements operate in response 
to decoded instructions while receiving clocking signals, 
comprising means for identifying instruction types; and 
means for enabling or disabling clocking signals to at 
least one of said elements in dependence upon whether 40 
said element is required for the execution of said instruc- 
tion type. 

According to a second aspect of the present inven- 
tion, there is provided a data processing circutt compris- 
ing a plurality of elements, wherein said elements oper- 45 
ate in response to decoded instructions while receiving 
clocking signals, comprising means for identifying in- 
struction types, and means for enabling or disabling at 
least one of said elements in dependence upon whether 
said element is required for the execution of said instruc- so 
tion type. 

In a preferred embodiment, the elements include a 
data processing unit and a data address generating unit 
and the data processing unit may include more than one 
multiplier and more than one arithmetic unit with co-op- ss 
erating multiplexed circuits. 

Preferably, the means for identifying instruction 
types is an instruction decode unit having an instruction 



type decoder and the instruction type decoder may be 
configured to gate clock signals. Preferably, gated clock 
signals and input instructions are supplied to a plurality 
of instruction decoders. Optionally an instruction type 
decoder is configured to enable or disable instruction 
decoders for respective instruction types, and the in- 
struction decoders are configured to gate clock signals. 

According to a third aspect of the present invention, 
there is provided a method of processing data, wherein 
a plurality of elements within a data processing circuit 
operate in response to decoded instructions while re- 
ceiving clocking signals, comprising steps of identifying 
instruction type; and enabling or disabling clocking sig- 
nals to at least one of said elements in dependence up- 
on whether said element is required for the execution of 
said instruction type. 

According to a fourth aspect of the present invention 
there is provided a method of processing data, wherein 
a plurality of elements within a data processing circuit 
operate in response to decoded instructions while re- 
ceiving clocking signals, comprising steps of identifying 
instruction types, and enabling or disabling at least one 
of said elements in dependence upon whether said el- 
ement is required for the execution of said instruction 
type. 

The invention will now be described by way of ex- 
ample only with reference to the accompanying draw- 
ings in which: 

Figure 1 shows a mobile telephone having digital 
signal processing circuitry; 

Figure 2 shows the functionality of digital signal 
processing circuitry within the mobile telephone 
shown in Figure 1, including a digital signal proces- 
sor; 

Figure 3 details a digital signal processor of the type 
identified in Figure 2, including an instruction de- 
code unit and a data processing unit; 
Figure 4 details the instruction decode unit shown 
in Figure 3, including an instruction type decoder; 
Figure 5 details the instruction type decoder identi- 
fied in Figure 4; 

Figure 6 details the data processing unit identified 
in Figure 3, having two multipliers, two anthmetic 
logic units, a compare and select unit and a normal- 
isation unit to facilitate the processing of floating 
point numbers; 

Figure 7 shows a block of floating point numbers; 
Figure 8 shows a block of floating point numbers 
similar to the block shown in Figure 7, normalised 
so as to have matching exponent values; 
Figure 9 details one of the multipliers shown in Fig- 
ure 6; 

Figure 10 details one of the arithmetic logic units 
shown in Figure 6; 

Figure 1 1 details the compare and select unit shown 
in Figure 6; and 

Figure 12 details the normalising unit shown in Fig- 
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ure 6. 

The present invention has many applications par- 
ticularly in systems where power consumption is to be 
minimised. As an example of such an environment, the $ 
invention will be described with respect to a mobile tel- 
ephone, although it should be appreciated that many 
other applications are also relevant. 

A mobile telephone is shown in Figure 1 configured 
to operate digitally in accordance with the GSM recom- 10 
mendations. The telephone includes a microphone 101 
arranged to receive speech utterances in combination 
wrth a loudspeaker 102 arranged to supply received 
voice signals to an operator's ear. The telephone in- 
cludes manually operable buttons 1 03 and a visual dis- *5 
play 104. 

The mobile telephone shown in Figure 1 includes a 
digital data processing circuit 201 as shown in Figure 2. 
The processing circuit receives digital input signals from 
rtn rtnri toque io digital converter 202 which in turn re- 20 
ccivcb mirtogjc speech signals from microphone 101. 
Wiihm inc dntrt processing circuit 201, digital signals 
tfom ihc rtr^*cquo to digital converter 202 are supplied 
to .1 cpeocf crcodmg subsystem 203 which in turn sup- 
pi enrooca speech to a channel coding and redun- 2S 
amc^ cube yc cm 204. Loudspeaker 102 receives an 
output horr th ? o-iia processing circuit 201 via a digital 
to ^n-iioquc converter 205. Radio antenna 206 receives 
ifHn<yni<^«vi Mrjrnls from a transmitter 207 which is in 
turn rtrrnnryxi 10 receive outputs from the channel cod- 30 
tng subsystem 204 The antenna 206 is also shared with 
a receiving circuit 208 which in turn supplies input sig- 
nals 10 a channel reconstruction and decoding subsys- 
tem 209 withm ihe data processing circuit 201 . The out- 
put from subsystem 209 is supplied to a speech decod- -35 
mg subsystem 21 0 which in turn provides a digital 
speech signal to the digital to analogue converter 205. 

The data processing circuit 201 is implemented as 
a programmable device capable of multiplexing proc- 
esses 203 204. 209 and 210. A hardware implementa- *o 
tion of the processing circuit 201 is shown in Figure 3, 
and includes a program memory 301 and a data mem- 
ory 302. Processing circuit 201 is event driven which is 
to say that the circuit will be placed in an idle mode when 
no interrupts are pending. Interrupt signals are supplied *$ 
to an interrupt handler 303 by interrupt lines 304. Upon 
receiving an interrupt, the interrupt handler supplies an 
enabling signal over a line 305 to a phase locked loop 
circuit 306. The phase locked loop circuit receives clock- 
ing signals from a system clock over a line 307 and is so 
arranged to supply higher frequency clock signals to a 
clock generator 308 over a line 309. 

In conventional systems, the clock generator 308 
will be arranged to supply clocking signals to virtually all 
of the active elements within the circuit. However, in ac- ss 
cordance with the present invention, an instruction de- 
code unit 320 provides clock enable signals to a clock 
generator 308 via a line 319. This controls the supply of 



the clocking signals to the active elements within the 
system. 

The interrupt handling circuit 303 also supplies in- 
terrupt signals to an instruction fetch unit 312 over line 
309 which in turn generates an address to the program 
memory 301 over an address bus 31 3. The resulting in- 
struction read from program memory is returned to the 
instruction decode unit 320 over an instruction bus 314. 
The instruction decode unit 320 decodes the instruction 
read from program memory 301 and supplies appropri- 
ate control signals to a data processing unit 315 and a 
data address generation unit 316, as well as other units. 
The data address generation unit 316 addresses the da- 
ta memory via an address bus 31 7 and the communica- 
tion of data between the data memory 302 and the data 
processing unit 31 5 or the data address generating unit 
316 occurs via a data bus 318. 

The instruction decode and clock generator unit 31 0 
is detailed in Figure 4 in which instructions received from 
the program memory 301 via the instruction bus 314 ara 
supplied to an instruction register 401 and an instruction 
type decoder 402. The instruction register 401 initiates' 
the start of instruction decoding such that an encoded 
instruction will result in the appropriate control signals 
being supplied to the data processing unit 315, along 
with any other required components. 

The instruction decode and clock generator unit 310 
also includes a plurality of decoders for instruction type 
403, 404, 405 and 406. An output from the instruction 
type decoder 402 is supplied to an instruction type reg- 
ister 407, thereby equalising the delay introduced by the 
instruction register 401. The instruction type register 
403 also receives the clocking signal generated by clock 
generator 308 on line 311 , which is also supplied to the 
instruction register 401 and to each of the decoders for 
instruction type 403 to 406. Similarly, the output from the 
instruction type register 407 and the output from the in- 
struction register 401 are supplied to each of the decod- 
ers for instruction type 403 to 406. 

The decoders for instruction type 403 to 406 are 
each arranged to convey clocking signals and control 
signals to respective parts of the data processing circuit. 
Thus, in addition to distributing control signals, the in- 
struction decode and clock generator unit 310 is also 
responsible for distributing clocking signals. Further- 
more, these clocking signals are gated such that, for a 
particular instruction, clock signals are only supplied to 
relevant parts o( the circuit so that parts of the circuit 
that are not required to implement a particular instruc- 
tion are not active and are therefore not consuming pow- 
er unnecessarily. 

Instruction type decoder 403 is detailed in Figure 5. 
The decoder includes a multiplexer 501 , a decoder 502 
and a clock switch 503. The output from the instruction 
type register 407 comprises a multi-bit bus and one line 
from said bus is supplied to each of the decoders 403 
to 406, indicating that a particular decoder is to be en- 
abled. Thus, this enabling signal is supplied to the mul- 
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tiplexer 501 which, when enabled, allows data from the 
instruction register 401 to be conveyed to the decoder 
502. Alternatively if the multiplexer 501 does not receive 
an enabling signal from the instruction type register 407, 
a "no operation" signal is conveyed to the decoder 502, 
indicating that this part of the circuit is not required to 
perform an operation during the clock cycle under con- 
sideration. 

If an instruction has been conveyed via multiplexer 
501 to decoder 502, the instruction is decoded at the 
decoder 502 resulting in control signals being supplied 
over control lines 504. In addition, an enabling signal is 
supplied to clock switch 503 over line 505 resulting in 
the processor clock signal on line 311 being gated to 
gated clock signal lines 506. Thus, when active control 
signals are supplied on control signal lines 504, appro- 
priate clock signals are distributed throughout the circuit 
over lines 506. 

However, when these elements do not require to 
operate in response to control signals on line 504, the 
clock switch is switched off, clock signals are disabled 
and the relevant circuit elements are left in an idle state. 
Thus, although the components are receiving power, 
they are not actually being clocked. The circuit elements 
are designed such that power consumption is almost ze- 
ro when placed in a non -clocking state, possibly by im- 
plementation using CMOS technology. 

Data processing unit 31 5 is detailed in Figure 6. The 
data processing unit is arranged to perform multiplica- 
tion operations by providing a first multiplier 601 and a 
second multiplier 602. Each multiplier provides a 34 bit 
output over respective buses 603 and 604 to respective 
arithmetic logic units, consisting of a first arithmetic logic 
unit 605, arranged to receive data from bus 603 and a 
second arithmetic logic unit 606 arranged to receive da- 
ta from bus 604. 

Data bus 407 provides for data transmission be- 
tween the data processing unit 405 and the data mem- 
ory 402 shown in Figure 4. Returning to Figure 6, this 
data bus is implemented as two 16 bit buses a first of 
which, first data bus 607, facilitates reading from the da- 
ta memory 402 and writing to said data memory. A sec- 
ond data bus 608 is similar to data bus 607 but is only 
configured to read from the data memory 402. 

The data processing unit includes an output multi- 
plexer 609 configured to allow a plurality of output sig- 
nals to be selectively returned to the first data bus 607 
or to the data address generation unit 406. 

A full clock cycle may be employed to allow the 
transfer of data from the data memory, given that each 
transfer to the data processing unit 405 is buffered by 
one of eight registers 610 to 617. The processing unit 
also includes a compare and select unit 61 8 and a nor- 
malising unit 6 1 9 all of the unit is clocked from a common 
clock source. However, in order to effectively double the 
speed of operation, devices to the left of the compare 
and select unit 618 are clocked on the rising edge of the 
clock source with devices to the right of the compare 



and select unit 618 being clocked on the falling edge of 
the clock source. The compare and select unit 618 is 
required to operate for both halves of the unit and is 
therefore clocked on both the rising and falling edges of 
s the clock source. Input register 610 receives data from 
the first data bus 607 and. upon receiving a rising clock 
edge, clocks 16 bit data to the normalising unit 619 over 
a bus 620. Input register 611 clocks data from the sec- 
ond data bus 603 on a rising edge to, the first multiplier 
'0 601 and to the first arithmetic logic unit 605 over bus 
621. Input register 612 clocks data from the first data 
bus 607 on a rising edge to the first multiplier 601 over 
bus 622. Input register 61 3 clocks data from the second 
data bus 608 on a rising edge to the first arithmetic logic 
*5 unit 605 over bus 623. 

Input register 614 clocks data from the first data bus 
607 on a falling edge to the second arithmetic logic unit 
606 over a bus 624. Input register 61 5 clocks data from 
the second data bus 608 on a falling edge to the second 
multiplier 602 and to the second arithmetic logic unit 606 
via bus 625. Input register 616 clocks data from the first 
data bus 607 on a falling edge to the second multiplier 
602 and to the second arithmetic logic unit 606 over a 
bus 625. Input register 61 7 clocks data from the second 
data bus 608 on a falling edge to the second arithmetic 
logic unit 606 via bus 626. 

The output from the first multiplier 601 , supplied to 
the first arithmetic logic unit 605 over bus 603. is also 
supplied over a similar bus 626 to the output multiplexer 
609. The output from the arithmetic logic unit 605 is re- 
turned to an input of the first multiplier 601 via a bus 627. 
which also supplies an output to the output multiplexer 
609. Similarly, this output is also returned to an input of 
the second arithmetic unit, effectively as an extension 
of the same bus. 

Ten bit bus 628 and single bit lines 629 and 630 sup- 
ply data to sequencer 403 to facilitate conditional jumps. 
The second arithmetic logic unit 606 cannot instigate a 
conditional jump but overflow conditions may be identi- 
fied over two control lines 631 . The output from the sec- 
ond arithmetic logic unit 606 is returned to an input of 
the first arithmetic logic unit 605 via a bus 632, which 
also supplies said output to the output mutliplexer 609 
and an input to the second multiplier 602. An output from 
the first arithmetic logic unit 605 is supplied to the com- 
pare and select unit 618 over a 32 bit bus 633 and a 
similar output is supplied to the compare and select unit 
61 8 from the second arithmetic logic unit 606 over a 32 
bit bus 634. 

Normalising unit 619 produces an output exponent 
which is supplied, via a bus 635 to the output multiplexer 
609 and to the compare and select unit 618. The nor- 
malising unit 619 generates shifting signals which are 
supplied to the first arithmetic logic unit 605 via shift bus- 
es 636 and 637. A 42 bit output is supplied from the first 
arithmetic logic unit 605 to the normalising unit 619 via 
a bus 638, while a bus 639 supplies an output from the 
compare and select unit 61 8 to the normalising unit 619 
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and to the output multiplexer 609. The data on bus 639 
relates to the left side rising edge subcycle and a similar 
output from the compare and select unit concerning the 
right side falling edge subcycle is supplied to the output 
multiplexer 609 over a bus 640. 

For the architecture shown in Figure 6 and de- 
scribed in relation to a specific embodiment of the in- 
vention, arithmetic logic units 605, 606 provide respec- 
tive exponent values from registers 611, 612, 615, 616 
to compare and select unit 618. Optionally, an architec- 
ture may be configured such that registers 611, 612, 
615, 616 provide exponent values directly to compare 
a select unit 618. 

A compare and select unit 618 operates together 
with both the first arithmetic logic unit and the second 
arithmetic logic unit and in doing so it may compare a 
pair of exponent values whereafter one of said pair may 
be stored, dependent upon the result of the comparison. 
In this way, an extreme example taken from a plurality 
of exponent values, usually the largest exponent value 
is stored by the compare and select unit so as to facili- 
tate subsequent processing and to significantly reduce 
the operational overhead of the arithmetic logic units. In 
particular, this stored, value may be supplied over bus 
639 to the normalising unit 619. In this way, the normal- 
ising unit may supply shifting signals to the arithmetic 
logic unit, in response to comparisons made by the com- 
pare and select unit, so as to normalise respective man- 
tissa associated with exponents of floating point num- 
bers considered by the compare and select unit. 

In Figure 7 a block of floating point numbers is rep- 
resented as comprising floating point number 701 , float- 
ing point number 702, floating point number 703 and 
floating point number 704. It should be understood that 
a large block of floating point numbers is often encoun- 
tered in data processing. Each floating point number 
consists of a mantissa and an exponent portion. Typi- 
cally, a block of floating point numbers of this type will 
result from a processing algorithm which is used in one 
of the processors 203, 204, 210 or 209 shown in Figure 
2 operating in the mobile phone. The exponent of float- 
ing point number 701 f and the exponent of floating point 
number 702 are similar but not the same. Furthermore 
the other exponents in the block of floating point num- 
bers are likely to be similar but not identical. In order to 
perform efficient data processing on a block of floating 
point numbers, it is preferable to have matching expo- 
nents throughout the entire block of numbers. 

The arrangement shown in Figure 6 facilitates effi- 
cient translation of a block of floating point numbers of 
the type shown in Figure 7, into a block of floating point 
number of the type shown in Figure 8, characterised by 
having identical exponents. There are two stages to the 
process of conversion from the block of floating point 
numbers represented in Figure 7 to the block of floating 
point numbers shown in Figure 8. The first process may 
be performed during generation of the floating point 
number block, or through a subsequent pass through 



the block of floating point numbers, during which expo- 
nents are compared and the greatest exponent is re- 
tained. Under program control, exponents are supplied 
to the compare and select unit 618. which is capable of 
5 performing comparisons between an incoming expo- 
nent and a retained high value exponent from previous 
comparisons. Thereby the exponents in a block of float- 
ing point numbers may be compared, and a highest ex- 
ponent retained. 
*o The second stage of the process, of converting from 
the block of floating point numbers shown in Figure 7 to 
the block of floating point numbers shown in Figure 3, 
is performed by the normalising unit 619. The highest 
value exponent which has been determined by the com- 

'5 pare and select unit in the first process is retained as a 
number which is transferred over bus 639 to the normal- 
ising unit 619. The normalising unit receives exponents 
for each number in the block of floating point numbers 
to be converted and performs a subtraction between the 

20 highest value exponent and the exponents for each 
number in the block, such that the number of shrfts of 
the mantissa that is required for each floating point 
number to be normalised to the highest exponent value 
is determined and supplied to the first arithmetic logic 

2B unit 605. The first arithmetic logic unit 605 includes a 
shifting circuit, which is capable of shifting the mantissa 
of the floating point number the desired number of binary 
places in a single cycle. In this way, in a two stage proc- 
ess, the block of floating point numbers shown in Figure 

30 7 is converted into the block of floating point number 
shown in Figure 8, represented as floating point number 
801 r 802. 303 and 804. Thereby facilitating the manip- 
ulation of a block of normalised floating point numbers 
in the fixed point arithmetic logic unit 605. 

3S The multiplier 601 shown in Figure 6 is detailed in 
Figure 9. A first signed/unsigned multiplexer 901 re- 
ceives control signals 91 1 from the sequencer 403. Data 
signals are received by multiplexer 901 from the data 
bus 621 , which may then be considered as signed or 

40 unsigned under control of the sequencer 403. A similar 
arrangement occurs with a signed/unsigned multplexer 
902 which receives signals from the data bus 622. A 
multiplexer 903 supplies A and B signals to a 1 7 bit two's 
compliment multipler 904. The product of this multipli- 
es cation is supplied to a multiplier shift/round unit 905 
which is controlled by control signals 912 from the se- 
quencer 403. The output of the multiplier shift/round unit 
905 is supplied to a multiplier register 906 which is con- 
trolled by sequencer 403 via control lines 913. The out- 

so put of multiplier register 906 forms the bus 603, which 
may be supplied as an input to the multiplexer 903, thus 
facilitating feedback of multiplication products within the 
multiplier unit 601 itself. The multiplexer 903 may also 
receive predetermined fixed zero values as either or 

55 both of the multiplication operands, such that circuitry 
within the multiplier itself may be set to a static value, 
thereby reducing power consumption. 

The first arithmetic logic unit 605 shown in Figure 6 
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is detailed in Figure 10. A multiplexer 1001 selects sig- 
nals from bus 621 . bus 603. bus 632, bus 622 or numeric 
value zero. Multiplexer 1001 is controlled by control sig- 
nals supplied from the sequencer 403. An arithmetic log- 
ic circuit 1002 is controlled by control lines 1102 which 
are received from the sequencer 403. The arithmetic 
logic circuit 1002 has an numeric output 1203 which is 
supplied to an arithmetic logic shifter 1003. The arith- 
metic logic circuit 1002 further supplies condition signals 
to condition logic 1008, 1009, which facilitate conditional 
instructions, and exception condition detection. 

The arithmetic logic shifter 1003 is controlled via 
control logic 1004, by control lines 1103, which are sup- 
plied from the sequencer 403. Control logic 1004 is also 
supplied by buses 636 and 637, which may be used to 
determine the number of binary places which the shifter 
should impose upon a binary number received via bus 
1203 The output of the arithmetic logic shifter 1003 is 
supplied to a rounding unit 1005, which is controlled by 
a control line 1104 from sequencer 403. The rounding 
unit 1005 ia> u;>cd to convert highly precise but long bi- 
nary nu"nbcrs having 42 bits into shorter representa- 
tions ol srt»d njmoers, in order that they may be stored 
cf'icicntry n memory and processed by other parts of 
the a it. i processing unit which have input word lengths 
ol iocs trv«n 42 bits. The rounding unit also supplies a 
cond ton signal to condition detecting logic 1010, and a 
Saturn lino, Circuit 1006. 

The s^tufHimg circuit 1006 receives the numerical 
ouiput from the rounding circuit 1005. The saturating cir- 
cuit 1 006 is Hblc to perform a limiting function on numer- 
ical values which exceed a predetermined limit, again 
in order that large binary numbers may be reduced ef- 
ficiently to a suitable word length for storage or subse- 
quent processing. The saturation circuit 1006 is control- 
led by control line 1105 from sequencer 403. The satu- 
ration circuit may further be controlled by acondition sig- 
nal from the arithmetic logic circuit 1002, which may in- 
dicate when a particular arithmetic logic operation has 
resulted in a numerical value which must be saturated. 

The output of the saturating unit 1006 is supplied to 
an accumulator register 1007, which is clocked by a 
clock signal 1106. The accumulator register 1007 sup- 
plies signals to buses 627 and 628. Thereby it is possi- 
ble tor the accumulator register 1 007 to supply a numer- 
ical value back to the multiplexer 1001 , for further arith- 
metic logic operations. 

A condition generation block 1011 receives numer- 
ical values from the arithmetic logic shifter 1003, and 
condition signalling from signalling logic 1008 : 1009, 
1010 and 1012. Signals 1012 are supplied from the 
compare and select unit 618. The condition generation 
block 1011 also receives signals from the bus 631 . The 
condition generation unit supplies signals 629 and 630. 

The compare and select unit 618 shown in Figure 
6 is detailed in Figure 11 . A low order word input multi- 
plexer 1151 receives input values from bus 633 or bus 
634. Alternatively, a zero value may be used as a nu- 



merical input to this multiplexer. A high order word input 
multiplexer 1 1 52 may also receive signals from bus 633, 
or bus 634, or a numerical value zero. Both input word 
multiplexers 1151 and 11 52 are controlled by control sig- 
5 nals 1161 and 1162 which are supplied from the se- 
quencer 403. The low order input word multiplexer 1151 
may supply numerical values to a compare unit 1153 or 
an additional multiplexer 1154. The high order input 
word multiplexer 1152 supplies its numerical value to a 
w second input of the compare unit 1153, and a second 
input of the additional multiplexer 1 1 54. A compare unit 
1 1 53 generates an indication as to whether the low order 
word input from multiplexer 1151 is greater than or equal 
to the high order word input supplied from multiplexer 
'5 1152. This indication is supplied as a signal to a maxi- 
mum or minimum plurality inversion circuit 1155 which 
generates a compare signal, which is in turn supplied to 
the additional multiplexer 1 1 54 or a compare select sta- 
tus register 1 1 56. 

20 The high order input word multiplexer 1 1 52 is further 
capable receiving as an alternate input the output expo- 
nent from the normalising unit 6 1 9, represented as a bus 
635. The output of the additional multiplexer 1 1 54 is sup- 
plied as an input to a compare and select register 11 57, 

2S its contents typically represent the highest of a se- 
quence of compared exponents in a block floating point 
exponent comparison procedure. In order to facilitate 
such a procedure, the output of the compare and select 
register 1157 must be supplied as a numerical value 

30 ever an internal bus 1199 via the low order word input 
multiplexer 1151 to the first input of the compare unit 
11 53. Then, as exponents are supplied to an input of the 
high order word input multiplexer 1152, comparisons 
may be made such that the highest of a sequence of 

3S compared exponents will end up being stored in the 
compare and select register 1157. 

The compare and select status register 1156 sup- 
plies numerical values to buses 639 and 640. This reg- 
ister also receives a clock signal 1165 from control cir- 

40 cuitry. The compare and select register 1 1 57 is clocked 
by a signal 1166 from control circuitry. 

The normalising unit 619 shown in Figure 6 is de- 
tailed in Figure 12. The bus 638 supplies 42 bit numer- 
ical values to a unit 1271 which determines the most 

*s significant bit of the numerical value, and encodes this 
as a 6 bit value supplied over bus 637. This 6 bit value 
637 is further supplied to a multiplexer 1 272 which may 
alternately receive a 1 6 bit numerical value from the bus 
639 connected to the compare and select unit 618. The 

so 16 bit output of the multiplexer 1272 is supplied as a 
subtracting input to a subtractor 1273. The multiplexer 
1272 is controlled by a control signal 1281. An additional 
input to the subtractor 1273 is supplied from bus 620. 
The output of the subtractor 1273 is supplied to an out- 

ss put exponent register 1274, which is controlled by a 
clock signal 1282 supplied from control circuitry. Six bits 
of the output from the subtractor 1273 are supplied as 
bus 636. The output of the output exponent register 
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1 274 is supplied as a 16 bit numerical value to the bus 

635. 

In view of the foregoing description it will be evident 
to a person skilled in the art that various modifications 
may be made within the scope of the invention. s 

For example, in an alternative embodiment the in- 
struction type decoder 402 and the instruction type reg- 
ister 407 are removed from the instruction decode unit 
310. The new instruction decode unit 310 is comprised 
of the instruction register 401 and an instruction decod- 10 2. 
er. As in the previous embodiment instructions received 
from the program memory 301 are supplied to the in- 
struction register 401 via the instruction bus 314. The 
output from the instruction register 401 is supplied to the 
instruction decoder. The instruction decoder comprises *5 
the same components as the decoder for instruction 
type 403 with the exception that the multiplexer 501 is 
removed and the instruction from the instruction register 
401 is conveyed directly into the decoder 502. The de- 
coder decodes the instruction as described previously. 20 
Thus only those elements of the processor required for 3. 
execution of the instruction receive active control sig- 
nals and appropriate -clock signals. Current consump- 
tion in the remaining. circuit elements is almost zero, 
when implemented using a technology such as CMOS. 2$ 4. 
Th is embodiment is smaller and hence cheaper than the 
previous embodiment but consumes more power as the 
whole instruction decode unit is active during an instruc- 
tion decode This embodiment is an extreme example 
of the trade-off between the complexity of the instruction 30 5. 
type decoder and the level of power consumption in the 
instruction decode unit. A person skilled in the art can 
determine what level of complexity in the instruction type 
decoder is appropriate for a particular application. The 
scope of the present disclosure includes any novel tea- 3S 6. 
ture or combination of features disclosed therein either 
explicitly or implicitly or any generalisation thereof irre- 
spective of whether or not it relates to the claimed in- 
vention or mitigates any or all of the problems addressed 7. 
by the present invention. The applicant hereby gives no- «o 
tice that new claims may be formulated to such features 
during prosecution of this application or of any such fur- 
ther application derived therefrom. 8. 

The scope of the present disclosure includes any 
novel feature or combination of features disclosed *s 
therein either explicitly or implicitly or any generalisation 
thereof irrespective of whether or not it relates to the 
claimed invention or mitigates any or all of the problems 9. 
addressed by the present invention. The applicant here- 
by gives notice that new claims may be formulated to so 
such features during prosecution of this application or 
of any such further application derived therefrom. 10. 



Claims ss 

1. A data processing circuit comprising a plurality of 
elements, wherein said elements operate in re- 



sponse to decoded instructions while receiving 
clocking signals, comprising 

means for identifying instruction types: and 
means for enabling or disabling clocking sig- 
nals to at least one of said elements in depend- 
ence upon whether said element is required for 
the execution of said instruction type. 

A data processing circuit comprising a plurality of 
elements, wherein said elements operate in re- 
sponse to decoded instructions while receiving 
clocking signals, comprising 

means for identifying instruction types; and 
means for enabling or disabling at least one of 
said elements in dependence upon whether 
said element is required for the execution of 
said instruction type. 

A data processing circuit according to claim" 1 or 
claim 2, wherein said elements include a -data- 
processing unit and a data address generating unit. 

A data processing circuit according to claim 3, 
wherein data processing unit includes more than 
one multiplier and more than one arithmetic unit 
with co-operating multiplexed circuits. 

A data processing circuit according to any of claims 
1 to 4, wherein said means for identifying instruction 
types is an instruction decode unit having an in- 
struction type decoder. 

A data processing circuit according to claim 5, 
wherein said instruction type decoder is configured 
to enable gating of clock signals. 

A data processing circuit according to claim 6, 
wherein gated clock signals and input instructions 
are supplied to a plurality of instruction decoders. 

A data processing circuit according to claim 5, 
wherein said instruction type decoder is configured 
to enable or disable instruction decoders for respec- 
tive instruction types. 

A data processing circuit according to claim 8, 
wherein said instruction decoders are configured to 
gate clock signals. 

A method of processing data, wherein a plurality of 
elements within a data processing circuit operate in 
response to decoded instructions while receiving 
clocking signals, comprising steps of 

identifying instruction types; and 

enabling or disabling clocking signals to at least 
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one of said elements in dependence upon 
whether said element is required for the execu- 
tion of said instruction type. 

11. A method of processing data, wherein a plurality of s 
elements within a data processing circuit operate in 
response to decoded instructions while receiving 
clocking signals, comprising steps of 

identifying instruction types; and 10 
enabling or disabling at least one of said ele- 
ments in dependence upon whether said ele- 
ment is required for the execution of said in- 
struction type. 

15 

12. A method of processing data according to claim 10, 
wherein the enabling or disabling of clocking signals 
is effected with respect to clocking signals supplied 
to a data processing unit and to a data address gen- 
erating unit. 20 

1 3. A method of processing data according to claim 11, 
wherein the enabling or disabling of at least one of 
said elements is effected with respect to control sig- 
nals supplied to a data processing unit and to a data 2S 
address generating unit. 

1 4. A method according to claim 1 2 or claim 1 3, wherein 
said data processing unit is configured to perform 
multiplication functions and arithmetic functions, 30 
implemented by more than one multiplier and more 
than one arithmetic unit with co-operating multi- 
plexed circuits. 

15. A method according to any of claims 10 to 14, 3$ 
wherein instruction types are identified by means of 

an instruction decode unit having an instruction type 
decoder. 

16. A method according to claim 15, wherein clock sig- *o 
nals are gated by means of said instruction type de- 
coder. 

17. A method according to claim 16, wherein gated 
clock signals and input instructions are supplied to 4 $ 
a plurality of instruction decoders. 

18. A method according to claim 15, wherein said in- 
struction type decoder is configured to enable or 
disable instruction decoders for respective instruc- so 
tion types. 



19. A method according to claim 18, wherein said in- 
struction decoders are configured to gate clock sig- 
nals, ss 
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Fig.6. 
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Fig. 9. 
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